Dialogue act classification using a Bayesian approach∗

نویسنده

  • Sergio Grau
چکیده

In this work, we make a contribution to natural speech dialogue act detection. We focus our attention on the dialogue act classification using a Bayesian approach. Our classifier is tested on two corpora, the Switchboard and the Basurde tasks. A combination of a naive Bayes classifier and n-grams is used. The impact of different smoothing methods (Laplace and Witten Bell) and n-grams in classification are studied. With respect to the Switchboard corpus, an accuracy of 66% is achieved using a uniform naive Bayes classifier, 3-grams and Laplace smoothing to avoid zero probabilities. For the Basurde corpus, our system achieves performances similar to other methodologies we have previously tested. Through a combination of a naive Bayes classifier with 2-grams and Witten Bell smoothing we achieve the best accuracy of 89%. These results show that a Bayesian approach is well suited for these tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Approach to Dialogue Act Classification

The paper proposes a probabilistic approach to the interpretation of natural language utterances in terms of dialogue acts. Bayesian networks can be used to combine partial linguistic information with knowledge about the state of the dialogue and about the speaker, in order to find the most probable dialogue act performed, using standard probabilistic inference in the network. The proposed appr...

متن کامل

Dialogue Act Modelling Using Bayesian Networks

A probabilistic approach to interpretation of natural language utterances in terms of dialogue acts is proposed. It is illustrated how using Bayesian Networks, partial information obtained from an NLP component can be combined with knowledge the agent has about the state of the dialogue and about the user, in order to find the most probable dialogue act made.

متن کامل

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

Dynamic Bayesian networks and variable length genetic algorithm for designing cue-based model for dialogue act recognition

The automatic recognition of dialogue act is a task of crucial importance for the processing of natural language dialogue at discourse level. It is also one of the most challenging problems as most often the dialogue act is not expressed directly in speaker’s utterance. In this paper, a new cue-based model for dialogue act recognition is presented. The model is, essentially, a dynamic Bayesian ...

متن کامل

Using Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents

Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004